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DESCRIPTION 

A method for controlling a media content processing device, and a media content 
processing device 

The invention relates to a method for controlling a media content processing device and 
5 to a media content processing device. The invention relates in particular to the 

processing of media contents by means of media rendering devices such as televisions, 
personal computers or radios, or by media storage devices such as video recorders or 
audio recorders. 

10 The term "media content" preferably describes radio and/or television programs such as 
movies, plays, news broadcasts, music chart shows, sports broadcasts, documentaries, 
etc., and can mean an entire unit, for example an entire movie or an entire news 
broadcast, or an excerpt of an entire unit, such as only the regional news segment of a 
news broadcast, or only the top three of the music charts. A media content may have 

15 any type of format, genre, duration and classification. 

For many years now, content processing devices such as video recorders, televisions 
etc., have become an integral part of our daily lives. Even though technical parameters 
such as memory capacity, display features etc., have continuaHy been improved upon 
20 throughout the years, a satisfactory level of ease with which such devices can be 
controlled has still not been attained. 

Some promising suggestions have been made with a view to realising content 
processing devices as intelligent devices which can automatically control or organise 
25 themselves in such a way that the interaction between the user and device, necessary for 
control of the device, is considerably simplified or reduced. However, the suggested 
solutions often capitulate in the face of the fact that the devices usually used in a home 
environment generally do not dispose of the necessary processing power required for 
their realisation. 
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Therefore, an object of the present invention is to provide a method of controlling a 
content processing device, and to provide a content processing device, which aUow 
comfortable interaction between the user and the content processing device. 

The object of the invention is achieved by the features of the independent claims. 
Suitable and advantageous developments of the invention are defined by the features of 
the dependent claims. Further developments of the device claim according to the 
dependent claims of the method claim are also encompassed by the scope of the 
invention. 

In a method according to the invention, it is determined whether a media content to be 
processed is described by a predefined content descriptor from among multitude of 
predefined content descriptors. A device control parameter is then adjusted, depending 
on the content descriptor describing the media content to be processed. A control of a 
media content processing device is carried out in accordance with the device control 
parameter. 

The invention thus allows the control of a media content processing device to be 
automatically adapted to the content type of the media content bemg processed or to be 
processed, whereby the control of the media content processing device can be greatly 
simplified. 

Media content processing devices are generaUy able to carry out numerous control 
sequences. The greater the number of possible control sequences, the more complex is 
the control of such a media content processing device. The invention allows use of 
knowledge of the media content to be processed pertaining to its content type to 
determine in advance which of all possible control sequences would be most suitable 
for dealing with this type of content. Control of the media content processing device 
based on the remaining control sequence(s) is therefore made simpler for the user. 
According to the invention, selection of ^e permitted contirol sequence(s) can be 
effected, m particular, by configuration of the appropriate contirol parameter. 
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Furthermore, the invention allows that complex algorithms used in control of the 
content rendering device be greatly simplified with the aid of content descriptors, 
comprising information, present ahready in the media content, about the content type, as 
additional or supplementary information for control of the media content processing 
device. This simplification means that complex hardware - for example less processing 
power or less memory - is required in order to attain a satisfactory interaction between 
user and content rendering device. 

The term "content descriptor" covers all information suitable for describing a media 
content, e.g.: 

- names of actors, newscasters, presenters, talk-show guests; 

- voices of actors, newscasters, presenters, taUc-show guests; 

- languages of actors, newscasters, presenters, talk-show guests; 

- topics of documentaries, political discussions, sports shows; 

- the topicality or year of production of a broadcast content; 

- key- words or images present in a broadcast content; 

- title of a documentary, movie, political discussion, or sports show; 

- specific program descriptions, e.g. soccer match, rock music show etc,; 

_ program details, e.g. movie with Julia Roberts, news show with Dianne Sawyer 
etc.; 

- genre of the content (sports, news, movie, music show, jazz, 50s movie etc). 

Adjustment of a device control parameter depending on a content descriptor can take 
place in practice via directly interpretable rules or via algorithms that have to be 
computationally evaluated. 

Preferably, the media content to be processed or stored comprises a number of content 
descriptors, preferably determined or identified upon receiving or accessing the media 
content. Content descriptors can for example be supplied along with the media content 
by a provider such as a television broadcast provider. Equally, the content descriptor 
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can for example be broadcast to the media content processing d&vic& by a service 
provider, whereby the content desaiptors are imambiguously assigned to the 
appropriate media content 

5 Additionally or alternatively, a content descriptor can be entered by a user into the 
media content processing device, for example by means of a user interface. A user, 
when programming his video recorder to specify start time, date and channel, can for 
example enter supplementary information about the content type in the form of a 
content descriptor. This can be done by a menu-controlled selection of one of a number 
10 of content descriptors predefined by the video-recorder, or the user can enter a content 
descriptor himself. 

The content descriptor thus entered can alternatively or additionaUy be based on an 
electronic programming guide where the programs are classified according to content 
15 type, e.g. NexTView. 

In a particularly preferred embodiment, a content descriptor is extracted from the media 
content using known methods of analysis. For example, keywords can be extracted 
from the media content using methods of speech recognition, or specific voices can be 
20 identified in the media content by the use of speaker identification methods. 

The media content processing device preferably comprises a content rendering device 
or is itself a content rendering device such as, for example, a television, where the 
device control parameter controls the content rendering. Here, rendering of a content 
25 means presenting video content as video images on the screen, or converting audio 
content to audible sound. 

The device control parameter preferably controls the volume of the content rendering 
device, such as the volume of a television set. For example, the volume might be made 
30 louder for a sports program to create a stadium atmosphere, quieter for music programs 
to avoid disturbing any neighbours; louder for movies that feature a lot of dialogue; 
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quieter for action movies or action scenes with loud, possibly irritating, sound effects 
such as explosions or collisions accompanied by loud music soundtracks. 

In a particularly preferred embodiment, a function unit, for example a user interface or 
an automatic speech or speaker recognition unit, of the content processing device is 
configured with the aid of the device control parameter. The reaction (or behaviour) of 
this function unit in response to specific input parameters, in particular the output of 
output parameters or combinations of output parameters as a function of input 
parameters or combinations of input parameters, can thus be influenced by the 
configuration of this function unit. In this way the output parameters of the function 
unit are "indirectiy" controlled by the control parameters based on the content 
descriptors. 

This function unit preferably comprises a user interface or is part of a user interface, so 
that the device control parameta:, by configuring the appropriate control unit, controls 
the interaction between the user and the content rendering device. 

For example, the functionality of the off-switch of the television device may be adapted 
to the content type - during a 'normal' program, the television device will immediately 
switch off when the off-switch is being pressed, however, when the off-switch is 
pressed during a news program, the television device will stay on until the end of the 
news program, when it wiU automatically switch off. The user may define the desired 
reaction of the television device to the off-switch depending on the different content 
types. 

In a further preferred embodiment of the invention, the type of program defines the 
output modality that liie system or the systems' user interface uses to interact with the 
user. During a video oriented program (e.g. sports, action movie), the system chooses to 
interact with the user via audio signals (sounds or speech synthesis) in order not to 
interrupt the more important video part. During an audio oriented program (e.g. news, 
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comedy), the system may choose to interact with the user via video output (on-screen 
display) in ordesr not to interrupt the more important audio part. 

In a particularly preferred embodiment of the invention, the device control parameter 
controls the reaction of a content rendering device to remote control commands. This 
embodiment is equivalent to a solution whereby a device control parameter controls the 
association of the buttons on a remote contiol with functions of a media content 
processing device, i.e. it configures the way in which the media content processing 
device is remotely contiroUed, so that this embodiment also Hes within the scope of the 
invention. 

In the f oUowing, three preferred examples for the content type dependent reaction of a 

television device to remote control commands are described: 

Audio information might suffice during a news program. Therefore, switching 
channels only results in switching the video, while the audio still stays on tiie 
news channel. This enables browsing tiie other channels while still being 
informed about the news. 

Video information might be sufficient during a ^orts program Therefore, 
switching channels only results in switching the audio, while the video still 
stays on the sports channel. This enables browsing the other channels while still 
having all the information about the ongoing game. (Another alternative for 
sports programs is to automatically activate the 'picture-in-picture' function, 
when channels are being switched.) 

A 'context' button activates Ihe provision of additional information. The type of 
additional information depends on the type of the content being watched. For a 
news program, the 'context' button results in additional background about the 
current news item. During a movie, the 'context' button provides information 
about the actors. During a sports program, the 'context' button provides updated 
information about other ongoing games. 
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In an equally preferred embodiment of the invention, the function unit configured by 
the associated device control parameters comprises a speech recognition device or a 
speaker identification device or is part of a speech recognition device or a speaker 
identification device, so that the device control parameter ultimately controls a speech 
recognition method or a speaker identification method. The device control parameter 
can, for example, define a speech recognition vocabulary or a speech recognition 
grammar. By adapting the configuration of the speech recognition device or a speaker 
identification device to tiie current media content being processed, a speech recognition 
method or a speaker identification method can be carried out in a more effective 
manner, i,e. even a relatively simple hardware configuration can attain good recognition 
performance. 

In preferred embodiments of the invention, the device control parameters, in addition to 
or as an alternative to recognition vocabulary or recognition grammar, determine one or 
more of the following characteristics of speech recognition or speaker recognition 
methods: 

~ Speech understanding grammar 

- Dialogue description (for interaction between the user and the device) 

Acoustic models for the speech recognizer 

Language models for the speech recognizer 
. Pruning thresholds (for the speech recognition decoding process) 
. Confidence thresholds (for the decision making process within the device) 

The speech recognition process or the speaker identification process can be applied to 
search the audio information of the current media content to be processed for keywords, 
or for pre-determined speakers, and to further process the appropriately categorised 
content, for example by storing the appropriately categorised media content 

A media content processing device according to the present invention comprises a 
content descriptor detection arrangement, configured in such a way as to detect whether 
a media content to be processed is described by a predefined content descriptor or by 
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several predefined content descriptors. A control unit is configured such that a device 
control parameter is adjusted, depending on the content descriptor that describes the 
media content to be processed. A control of the media content processing device is 
carried out in accordance with this device control parameter. 

5 

The content descriptor detection arrangement can be realised as a content analysis unit, 
which extracts one or more content descriptors from the media content to be processed, 
or can be part of a receiver or storage access device, or may work together with a 
receiver or memory access device that can detect a content descriptor associated with 
10 the media content, for example as an accompanying signal. The content descriptor 

detection arrangement can however also operate in conjunction with a user interface or 
can part of a user interface that converts user input into correspondmg content 
descriptors. 



1 5 Other objects and features of the present invention will become apparent from the 
following detailed descriptions considered in conjunction with the accompanying 
drawing. It is to be understood, however, that the drawings arc designed solely for the 
purposes of illustration and not as a definition of the limi ts of the invention. 

20 Fig. 1 is a block diagram of the system architecture of a content processing device with 
a remote control module; 

Fig. 2 is a process sequence of a method for controlling a content processing device. 

25 The individual components of a media content processing device 1 are described in 
more detail with the aid of the figures, as well as the steps of an exemplary method for 
controlling a media content processing device 1. 

For the sake of clarity, only those components of a media content processing device 1 
30 necessary for an understanding of the invention are shown in the figures. It goes 
without saying that a media content processing device 1 also comprises any 
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components that are usually found in such processing systems, for example any 
necessary cables or connections, processors, power supplies, switching elements or bus 
systems. 

5 Figure 1 shows a media content processing device 1, such as an intelligent home 
entertainment centre, and, belonging thereto, a remote control 9 with a suitable 
interface, e.g. an infra-red interface. 

The media content processing device 1 shown in Fig. 1 incorporates a receiver 
10 arrangement 2, constracted in a way suitable for receiving media contents incoming via 
a broadcast channel 10. 

The speaker recognition device or the speech recognition device 3, which can be 
realised by means of a progranmiable processor, is able to recognise predefined 
15 keywords or specific voices in the received media content. 

A content storage unit 4, realised e.g. in the form of a hard-disk, can be used to store 
media contents, perhaps according to pre-defined rules. 

20 The content rendering device 5 can comprise a display unit or loudspeaker arrangement 
for rendering or replaying received or stored media contents. 

The components 2, 3, 4, 5 of the media content processing device 1 thus briefly 
described are connected in some way to a content descriptor detection unit 6, 
25 comprising a programmable processor. This is configured or constructed for the 
detection of content descriptors which describe the media content currently being 
processed. The content descriptors are extracted using suitable analysis methods from 
the media content, from a signal accompanying the received signal, or from information 
input by the user via the user interface 7. 

30 
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According to one realisation, the speech recognition device or speaker recognition 
device 3, as part of the content descriptor detection unit 6, can be applied to extract 
content descriptors such as key-words or speaker voices from the media content being 
processed. 

5 

The content descriptor(s) CD, detected by the content descriptor detection unit 6 and 
describing the current media content to be processed, are forwarded to a control unit 8. 
The control unit 8, which might also be realised as a programmable processor, controls 
the media content processing device 1, various components 2, 3, 4, 5 of the media 
10 content processing device 1 and the interaction between these components 2, 3, 4, 5. 

For example, in a memory unit, being a component of the control unit 8 and not shown 
in the figure, associations between content descriptors CDl, CD2 and values of various 
control parameters Pll, P12, P21, P22 are stored. The detected content descriptors(s) 
15 CD are converted according to these associations to the appropriate control parameter P 
or control parameters in the control unit 8. 

The control parameter P or derived control signals are then forwarded by the control 
unit 8 to the components 2, 3, 4, 5 described above, in order to control the components 
20 2, 3, 4, 5 of the media content processing device 1, thereby controlling the media 
content processing device 1, 



The approach thus described allows, for instance, the following applications: 

Control of a speech recognition device 3 in accordance with the media content 

25 to be processed: depending on which content descriptors are detected, control 

parameters are adjusted for the speech recognition device 3, such as, for 
example, pruning thresholds, so that the speech recognition device is configured 
in accordance with the content type. The media content to be processed can 
either be received by the receiver arrangement 2, or read from the content 

30 storage unit 4. f 
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Control of a content storage unit 4 in accordance witli the media content to be 
processed: depending on which content descriptors are detected, control 
parameters are adjusted to control the content storage unit 4. 
Control of a content rendering device 5 in accordance with the media content to 
5 be processed: depending on which content descriptors are detected, control 

parameters are adjusted for the content rendering device 5, such as to directly 
control the volume level or to configure £in appropriate function unit of the 
device 5 to influence the reaction of the content rendering device 5 to the remote 
control device 9. 

10 

The media content processing device 1 can be realised as part of a stand-alone device in 
the vicinity of the user, or may be distributed so that for example the receiver 
arrangement 2, the speech recognition device 3 or the speaker recognition device 3 and 
the content storage unit are realised as network elements of a broadcast provider or 
15 other provider, and the content rendering device 5 is located in the vicinity of the user. 
The individual components 2, 3, 4, 5, 6, 7 of the media content processing device 1 can 
each comprise a number of processors, or can share one or more processors. 

Fig. 2 shows a flow chart of a method for content type controlled interaction between a 
20 media content processing device 1 and a user. 

In a first step, a media content detection arrangement 6 detects content descriptors CD 
for determination whether an audio/video input VI, such as a movie or news program, is 
predominantly video-based or predominantly audio-based, i.e. whether the media 
25 content itself avails of predominantly video information (e.g, sports program, action 
movie) or predominantly audio information (e.g. news program, comedy show) in 
conveying information. The content descriptors CD are sent to a control unit. 

Depending on whether the media content is video-based or audio-based, the control unit 
30 8 sends control parameters A, V to an information output rendering module 1 1 of, for 
example, a TV device. 
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The user now requests the output of information which he requires - for example, for 
programming the media content processing device 1 - from the media content 
processing device 1 via a user interface, comprising a remote control 9. The information 
5 output rendering module 1 1 or another function unit (not shown), being the internal part 
of the user interface, is configured based on the control parameters A, V. In this way, 
the presence of a predominantly audio-based content results in a video-based output VO 
of the requested information by means of, for example, the TV screen, while the audio 
part of the incoming media content is further presented to the user without undergoing 
10 any interruption. The presence of a predomiuemtly video-based media content results in 
an audio-based output AO of the requested information over the loudspeaker 
arrangement of the TV device, while the video part of the incoming media content is 
further presented to the user without undergoing any interruption. 

15 In such an example, the user can also continue to watch a sports show broadcast on one 
channel, not missing any of the action, whilst listening to the news broadcast on an 
other channel. 

The example described above can be realised in practice also in such a way that the 
20 content descriptor detection unit 6 forwards the detected content descriptors directly to 
the output rendering module 1 1, which encompasses an appropriate control unit. The 
content descriptors are then converted to appropriate control parameters in the control 
unit. The control parameters in turn control the output rendering module 1 1 in such a 
way that the information requested by the user is rendered by adaptation to the media 
25 content currently being processed or rendered. 

Although the present invention has been disclosed in the form of preferred 
embodiments and variations thereon, it will be understood that numerous additional 
modifications and variations could be made thereto by a person skilled in the art, 
30 without departing from the scope of the invention. For example, the content processing 
device 1 may comprise only one of the components 2, 3, 4, 5 described, or any 
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combination of the components 2, 3, 4, 5 described. Also, the content processing device 
1 might be incoiporated partially or entirely in a personal computer. 

For the sake of clarity, it is also to be understood that the use of "a" or "an" throughout 
5 this application does not exclude a pluraUty, and "comprising" does not exclude other 
steps or elements, A "unit" or "module" may comprise a number of blocks or devices, 
unless explicitly described as a single entity. The term "hardware" can mean digital or 
analogue hardware, and might mean any type of circuitry such as boards, integrated 
circuits, off-the-shelf modules, custom modules etc. 

10 
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CLAIMS 



1. A method for controlling a media content processing device (1), 

where a multitude of content descriptors (GDI, CD2) are pre-defined, 
where it is determined whether a media content (VI) to be processed is 
described by a pre-defined content descriptor (GDI, GD2), 
5 - where a device control parameter (Pll, P12, P21, P22) is automatically adjusted 
based on the content descriptor (GDI, GD2) which describes the media content 
(VI) to be processed, and 

where the media content processing device (1) is automatically controlled, 
based on tlie device control parameter (Pll, P12, P21, P22), 

10 

2. A method according to claim 1, 

where a content descriptor (GDI, GD2), describing a media content (VI) to be 
processed, is entered by a user. 

15 3. A method according to any of the preceding claims, 

where a media content (VI) to be processed comprises, as an accompanying signal, a 
content descriptor (GDI, GD2) describing the media content (VI) to be processed, 

4. A method according to any of the preceding claims, 
20 where a content descriptor (GDI, CD2) describing the media content (VI) to be 
processed, is extracted from a media content (VI) to be processed. 
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5. A metliod according to any of tlie preceding claims, 

where the media content processing device (1) comprises a content rendering device 
(5), and the device control parameter (P11,P12,P21,P22) controls the content rendering. 

5 6. A method according to claim 5, 

where the device control parameter (Pll5P12,P21,P22) controls the volume of the 
content rendering device (5). 

7. A method according to any of the preceding claims, 

10 where the device control parameter (PI 1,P125P21,P22) configm^es a function unit of the 
media content processing device (1) to control the reaction of this function unit in 
response to specific input parameters. 

8. A method according to claim 7, 

15 where the function unit comprises a user interface or is part of a user interface, and the 
device control parameter (Pll, P12, P21, P22) controls the interaction between the user 
and the media content processing device (1). 

9. A method according to claim 8, 

20 where the device control parameter (Pll, P12, P21, P22) controls the response of the 
media content processing device (1) to remote control commands. 

10. A method according to any of the claims 7 to 9, 

where the function unit comprises a speech recognition device (3) or a speaker 
25 identification device (3) or is part of a speech recognition device (3) or a speaker 
identification device (3), and the device control parameter (Pll, P12, P21, P22) 
controls a speech recognition process or a speaker identification process. 
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1 1. A method according to any of the preceding claims, 

where the relationship between device control parameter (P11,P12,P21,P22) and 
content descriptor (CDl, CD2) can be configured by the user. 

5 12. Media content processing device (1) 

with a content descriptor detection arrangement (6), configured for determining 
whether a media content (VI) to be processed is described by a predefined content 
descriptor (CDl, CD2) of a multitude of predefined content descriptors (CDl, 
CD2), 

10 - with a control unit (8), configured 

such that a device control parameter (Pll, P12, P21, P22) is adjusted based on the 
content descriptor (CDl, CD2) describing the media content (VI) to be processed, 
and 

such the media content processing device (1) is automatically controlled based on 
15 the device control parameter (Pll, P12, P21, P22). 
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ABSTRACT 

Method for controlling a media content processing device, and a media content 
processing device 

The invention describes a method for controlling a media content processing device (!)• It 
5 is thereby determined whether a media content (VI) to be processed is described by a pre- 
defined content descriptor (GDI, CD2) from a multitude of pre-defined content 
descriptors (GDI, CD2). A device control parameter (Pll, P12, P21, P22) is automatically 
adjusted based on the content descriptor (GDI, GD2) which describes the media content 
(VI) to be processed. Then, the media content processing device (1) is automatically 
10 controlled, based on the device control parameter (Pll, P12, P21, P22). 
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